Cluster Assignment Strategies for a Clustered Trace Cache Processor

نویسندگان

  • Ravi Bhargava
  • Lizy K. John
چکیده

This report examines dynamic cluster assignment for a clustered trace cache processor (CTCP). Previously proposed clustering techniques run into unique problems as issue width and cluster count increase. Realistic design conditions, such as variable data forwarding latencies between clusters and a heavily partitioned instruction window also increase the degree of difficulty for effective cluster assignment. In this report, the trace cache and fill unit are used to perform effective dynamic cluster assignment. The retire-time fill unit analysis is aided by a dynamic profiling mechanism embedded within the trace cache. This mechanism provides information on inter-trace trace dependencies and critical inputs, elements absent in previous retire-time CTCP cluster assignment work. The strategy proposed in this report leads to more intra-cluster data forwarding and shorter data forwarding distances. In addition, performing this strategy at retire-time reduces issuetime complexity and eliminates early pipeline stages. This increases overall performance for the SPEC CPU2000 integer programs by 8.4% over our base CTCP architecture. This speedup is significantly higher than a previously proposed retire-time CTCP assignment strategy (1.9%). Dynamic cluster assignment is also evaluated for several alternate cluster designs as well as media benchmarks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rule Description for Clustered Disks to Make Asynchronous Backup and Recovery Flexible

Current disk drives contain a control processor and a large amount of memory for the disk cache, which enable the disk drives to implement high functionality within them. We propose autonomous disks in order to utilize the functionality for balancing load, tolerating faults, and recovering failures. A set of the autonomous disks configures a cluster in a network. A distributed directory structu...

متن کامل

The Performance Value of Shared Network Caches in Clustered Multiprocessor Workstations

This paper evaluates the bene t of adding a shared cache to the network interface as a means of improving the performance of networked workstations con gured as a distributed shared memory multi processor A cache on the network interface shared by all processors on each cluster o ers the potential bene ts of retaining evicted processor cache lines providing implicit prefetching when network cac...

متن کامل

Speculative Clustered Caches for Clustered Processors

Clustering is a technique for partitioning superscalar processor’s execution resources to simultaneously allow for more in-flight instructions, wider issue width, and more aggressive clock speeds. As either the size of individual clusters or the total number of clusters increases, the distance to the first level data cache increases as well. Although clustering may expose more parallelism by al...

متن کامل

Memory Data Organization for Improved Cache Performance in Embedded Processor Applications PREETI RANJAN PANDA and NIKIL

Code generation for embedded processors opens up the possibility for several performance optimization techniques that have been ignored by traditional compilers due to compilation time constraints. We present techniques that take into account the parameters of the data caches for organizing scalar and array variables declared in embedded code into memory, with the objective of improving data ca...

متن کامل

Achieving Load Balance and Eeective Caching in Clustered Web Servers

This paper considers policies for distributing requests in clustered Web servers, wherein multiple server machines are conngured to function as a single high(er) performance Web server. We evaluate various load distribution policies with respect to both their ability to achieve good load balance (the primary goal) and also to their impact on the eeec-tiveness of per-machine caching. Trace-drive...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003